Two claims are amended herein pursuant to a telephone interview held on March 
3, 2003 with Examiner Opsasnick, and a number of new claims are included. 

It is respectfully submitted that all of the claims are patentable in view of known 
prior art. Hence, consideration of the claims and allowance of same are respectfully 
solicited. 



Respectfully, 
Mark Beutnagel 
Joern Ostermann 
Schuyler Opackenbusch 




Dated: u ^7/ B y_ 





Henry TyBrendzel 
Reg. Nd?26,844 
Phone (973)467-2025 
Fax (973)467-6589 
email brendzel@comcast.net 



IN THE CLAIMS: 

1. (Presently Amended) A method for generating a signal rich in prosody 
information comprising the steps of: 

[including] inserting in said signal a plurality of phonemes represented by 
phoneme symbols, 

[including] inserting in said signal a duration specification associated with each of 
said phonemes, 

[including] inserting , for at least one of said phonemes, at least two prosody 
parameter specifications, with each specification of a prosody parameter specifying a 
target value for said prosody parameter and any selected point in time for reaching said 
target value , to thereby generate a signal adapted for converting into speech . 

21. (Presently Amended) A method for generating a signal rich in prosody 
information comprising: 

a first step for [including] inserting in said signal a plurality of phoneme symbols, 

a second step for [including] inserting in said signal a desired duration of each of 
said phoneme symbols, 

a third step for [including] inserting at least one target prosody parameter value 
within a duration for at least one of said phonemes at an explicitly chosen time offset 
from the beginning of the duration of said phoneme that is greater than zero and less than 
the duration of said phoneme. 

Please add the following claims: — 

23. The method for creating a signal responsive to a text input the results in a 
sequence of descriptive elements, including, a TTS sentence ID element, a silence 
duration specification element, if a silence specification is desired; a gender specification 
element, if gender specification is desired; an age specification element, if gender 
specification is desired; a number of text units specification element; and a detail 
specification the text units, the improvement comprising the step of: 

including in said detail specification of said text units 
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• preface information that includes indication of number of phonemes, 

• for each phoneme of said phonemes, an indication of number of parameter 
information tuples, N, and 

• N tuples, each tuple specifying a prosody parameter target value and a selectably 
chosen point in time for reaching said target value. 

24. The method of claim 23 where said text units are bytes of text. 

25. The method of claim 23 where said parameter information tuples relate to 

pitch. 

26. The method of claim 23 where N is an integer greater than 1 . 

27. The method of claim 23 where said preface includes a Dur_Enable indicator, 
and when said Dur_Enable indicator is at a predetermined state, said step of including 
also includes, a phoneme duration value for each phoneme of said phonemes. 

28. The method of claim 23 where said preface includes an FO_Contour_Enable 
indicator that is set at a predetermined state when said signal includes said N tuples. 

29. The method of claim 23 where said preface includes a 
Energy_Contour_Enable indicator, and when said Energy_Contour_Enable indicator is at 
a predetermined state, said step of including also includes, one or more energy value 
parameters. 

30. The method of claim 29 where said energy value parameters specify energy 
at beginning, middle, or/and end of phoneme pertaining to said Energy_Contour_Enable 
indicator. 

31. The method of claim 23 where said preface includes a listing of said 
phonemes. 
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32. A method for generating a signal for a chosen synthesizer that employs text, 
"phoneme, and prosody information input to generate speech, comprising the steps of: 

receiving a first number, M, of phonemes specification; 

receiving, for at least some phoneme, a second number, N, representing number 
of parameter tuples to be received for the phoneme; 

receiving N parameter tuples, each tuple specifying a parameter target value and a 
time for reaching said target value; 

translating said parameter tuples to form translated prosody information that is 
suitable for said chosen synthesizer; and 

including said translated prosody information in said signal. 

33. The method of claim 32 further comprising: 

a step, preceding said step of receiving said second number, M phoneme 
specifications; and 

a step of including in said signal phoneme specification information pertaining to 
said received M phoneme specifications, which information is compatible with said 
chosen synthesizer. 

34. The method of claim 32 further comprising the steps of 
receiving, following said step of receiving said N parameter tuples, energy 

information; and 

including in said signal a translation of said energy information, which translation 
is adapted for employment of the translated energy information by said chosen 
synthesizer. 

RECEIVED 

MAY 0 1 2003 
Technology Center 2600 
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